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ABSTRACT 


Sequences  of  order  symnetric  statistics  are 

examined  for  convergence  in  law.  Under  appropriate  condi- 
tions, a limiting  distribution  exists  and  is  equivalent  to 
that  of  a linear  combination  of  products  of  Hermite  poly- 
nomials of  independent  N(0,1)  random  variables. 

Connections  with  the  work  of  von  Mises,  Hoeffding,  ^uld 
Filippova  are  noted. 

AMS(MOS)  Subject  Classification:  Primary  60F05j  Secondary  62E20. 

Key  Words:  Symmetric  statistics,  Hermite  polynomials,  von  Mises 
statistics,  U-statistics. 

Work  Unit  No.  4 -Probability,  Statistics  and  Combinatorics 

Explanation 

In  probability  and  statistics  one  often  forms  symmetric  functions  of 

a set  of  observed  random  vari2d>les  X,  ,...,X  . Common  exan5)les  are  the 

1 n 


X,+. . .+X 


sample  mean  X^  ■ ^ and  the  sample  variance  I ^*i"^n^  ’ 

It  is  of  interest  to  know  vrtiether  such  quantities  possess  a limiting 
distribution  as  the  sample  size  gets  very  large.  In  this  report 
we  present  a quite  general  result  describing  the  nature  of  such  asymptotic 
distributions.  A special  case  is  the  classical  central  limit  theorem  for 
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ASYMPTOTIC  DISTRIBUTION  OF  SYfMUTRlC  STATISTICS 


I 


H.  Rubin  and  R.  A.  Vitale 


1.  Introduction. 


Synmctric  statistics  arise  naturally  in  a variety  of  contexts  and  accordin9ly  have  been 
investigated  from  several  points  of  view  (see,  for  instance,  von  Hises  |7),  Hoeffding  14), 
Filippova  (2),  and  Rubin  emd  Sethuraman  (8)).  Our  purpose  here  is  to  provide  a unified  ap- 
proach to  their  asymptotic  behavior  in  law  by  exploiting  an  orthogonal  expansion  technique. 
This  method  is  implicit  in  parts  of  (7)  where  it  is  apparently  employed  in  an  qd  hoc  fashion, 
A systcnetic  development,  as  we  shall  see,  can  bo  pursued  to  obtain  quite  general  results 
and  in  addition  offer  added  insight  into  the  structure  of  symmetric  statistics.  The 
central  role  of  Hermite  polynomials,  for  instance,  has  not  been  noted  before  to  the  l<i*st  of 
our  knowledge. 

II.  Recasting  a Symmetric  Statistic. 

It  will  be  convenient  to  consider  symmetric  statistics  in  a canonical  form  which  we 
develop  in  this  section. 


Let  us  consider  random  variables  X,,...,X  which  are  iid  and  a symmetric  statistic 

1 n 


Z » f(X,,...,X  ).  (The  reader  should  note  that  subsequent  discussions  do  not  require  the  X 
In  * 

to  be  real  valued.  Inasmuch  as  we  deal  only  with  functions  of  the  X^,  the  latter  may 
be  random  elements  of  any  appropriate  space.  The  present  study  was  in  fact  motivated  by  a 
consideration  of  functionals  of  random  sets  (Vitale  111)).)  We  assume  that  Z is  of  order  m 
- that  is, 

(1) 


k=0  1 k 


where  the  irmer  sum  is  over  a set  of  indices  closed  under  permutations  of  l,2,...,n.  Be- 
cause of  this  symmetry,  we  may  assume  that  the  Sj^  themselves  are  symmetric.  Moreover, 


since  an  evaluation  of  , Sj^  over  repeated  arguments  may  be  moved  to  a smaller  k (together 


with  its  symmetritations),  let  us  assume  that  this  has  been  done  and  that  each  inner  sum  is 
hollow,  i.e.,  over  a set  of  distinct  indices. 


Since  the  are  symmetric,  we  may  express  (1)  as 


(2) 


k-0  E.  k 

k 
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where  E|^  > (i^t  ...  , 1^)  is  e k elenent  subset  of  !«..., n,  - (X^  , ...  • ), 
and  " kiS|^(*).  By  the  elegant  device  of  introducing  conditional  expectations  of 
different  orders,  Hoeffding  [5]  has  shewn  that  under  suitable  conditions  (2)  can  be  re- 


written as 


'•I  £ %<\> 

where  the  reseoble  the  but  satisfy  additionally 

® ’Sc**! Vl-’Sc’  " ° 


and  hence 


if  kf/k'  or  if  k - k'  and  . 

Let  us  derive  this  representation  in  a somewhat  different  way  under  the  assumption  of 

square  integrability  of  the  . Without  loss  of  generality  we  may  assume  that  the  square 

integrable  functions  of  X.  form  a separable  Hilbert  space  with  orthonormal  basis  > 1, 

k 

W.  (X.),...  . Then  the  system  of  functions  { n (X.)}  forms  an  orthonormal  basis  for 

j-1  1 ^ 

the  Hilbert  space  of  square  integrable  functions  over  the  product  space  (X^,...,X^). 

In  the  expansion  for  T (X  ) ve  isolate  those  terms  in  which  the  basis  eleswnt  has  <fi 

k Ek  o 

occurring  as  a (k-h)-fold  factor,  0 h ^ k.  With  the  intrcxluction  of  a symmcttic 

function  of  h arguments,  this  contribution  can  be  written  as 


so  that 


h 


• Jo  4.0^  "“'x’ 


L I X "khV’‘ 


k-O  h-0  E^CE^ 

Changing  the  order  of  sunnation  yields 


I I I I 

k-0  E^  h-k  E^E^ 


**hk<\> 


EB(>loying  the  orthonormality  of  the  v>^,  one  can  verify  that 
E Whk**l'-'-*Vl'V  " ° 

BO  t**  t (3)  is  Hoeffding* s deconposition  with  the  assignment 


-2 


•‘V  • Jc> 


Finally,  naking  use  of  tha  representations  for  the  W..  , we  find  it  convenient  to  re-write 

l\K 

(3)  as 

m k 

«)  * - «0  * ^ ^ j ‘li  i ^ •••  I " ^ <*n  > 

k-1  i^/0  ‘l*-‘k  n^  n^  j-l  S "j 

for  some  coefficients  q . The  sero-th  order  ter*  has  been  isolated  so  that  i^  F 0,  and 
consequently  E F,  (X  ) > 0,  throuqhout. 

HI  Asymptotics 

Me  shall  consider  the  asyiqptotic  behavior  of  a sequence  of  order  ■ symnetric  statis- 
tics described  as  follows  (the  reader  will  recognise  expressions  fro*  the  previous  section 

with  additional  dependencies  on  n carried  by  left  subscripts) i for  each  n let  X. 

n 1 

i > l,...,n  be  iid  and  serve  as  arguments  for  the  symsietric  statistic 


■ J ^ S ( X.  X,  ) 

k“0  n k n ij  **  Si 


Oiollow  representation) . We  assume  t)iat 


E S*(  X„  ) < « 
n k n Ek 

so  that  Z can  be  written  as 
n 


k « 0, . . . ,m 
n ■ 1,2, , .. 


n 0 


I I •••  I ...i^  ^ ^ 


t-1  i^FO  V 


k 

n 

"i’^j 


F.  ( X_  ). 
n ij  n n^ 


Let  us  recall  t)>at  terms  of  this  expansion  are  orthogonal  except  t))at  two  arising  from 
different  permutations  of  i^,...,ij^  are  identical.  An  elementary  but  elaborate  calcula- 
tion depending  on  this  consideration  s)iows  that 


(5) 


Var(  Z)  - I kl  i ...  I 

k-1  ijFO  i^FO 


nl 


(n-k)  I n^i,...i. 


This  suggests  a convenient  normalization  within  each  term:  Since 

r . - q and 

n n ij...ij^ 


nl 


(n-k)  I 


~ n , let  us  set 


n li***Si 


n F,  ( X ) 
\ j-l  " J " "j 


so  t)iat 


"l»^J 
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w - n‘0  - ^ I - ^ 

k-lVO  i^,.0 

Ne  shall  prove  the  follawln9. 


r.  1 V.  , . 


Theorem.  Let  r.  < '*'  let  the  series  (S)  for  Var(  z)  be  uniformly  con- 

n ij...i^  ij...!,^  n 

wrgent.  For  each  i > 0 and  c > 0,  let 


,fiW  - » 


n + -.  Let 


denote  a sequence  of  iid  standard  notwal  variables.  Then 


Z - a converges  in  distribution  to 

Q n U a • 

m • 

(8)  Z-  X I •••  X i n H ,(G) 

k-1  ij>K)  lj^l«0  I"'  k j-1  ''‘‘'3'  3 

where  is  the  u-th  Hermite  polynomial  and  v(i,j)  is  the  nusber  of  times  j occurs 

in  the  sequence  i - i^ , . . . , i^ . 

►roof.  Ne  first  observe  that  a standard  truncation  argument  can  be  invoked s by  virtue 

of  the  uniform  convergence  of  the  scries  for  Var(  Z) , (6)  can  be  truncated  to  a finite  series 

n 

with  a uniformly  small  mean  square  error  (in  n ).  The  terms  of  Z share  the  "orthogonal 

or  identical”  property  with  those  of  Z-  z.  and  soreover 

n no 


Var(Z)  - y.  kl  y ...  y r^  . . 

k-1  i^i<o  ij^i«o 

These  facts  are  sufficient  for  the  convergence  in  law  of  truncations  to  imply  the  theorem 


(Mann  and  Wald  |6]). 

To  show  convergence  of  truncations  it  is  enough  to  verify  that  the  quantities 
n 3 n 3 n t 

are  asyag>totically  independent  standard  normal  variables  and  that  for  each  i,,...,i 

m ^ 

V,  , - n H ,,  .,  ( Y.)  -♦  0 in  probability, 

n ij^...ij^  v(i,3)  n 3 ' 

The  first  result  holds  since  the  Y.  are  already  orthonormal  and  in  (7)  we  have 

n j 

assumed  the  required  Lindeberg  condition. 


To  prove  the  second  result  we  apply  the  lemma  in  the  appendix  to  V. 

n i. 


He  see 


T({a  ,...,a  })  - n"^^^  y <p  (X  ) ...  »>,  (X  ) . 

1 n . n t n t 


3^  h “ 1,  this  is  precisely  Y^  . If  h ■ 2,  the  Lindeberg  condition  suffices  to  show  that 


-4- 


T((a,b))  -»  1 in  probability  if  ■ <f.  and  T((a,b))  -»  0 in  probability  otherviaa. 

n ia  n i,^ 

Also,  by  that  condition,  max  In  ^ ^,(X.y|  -*  0 in  probability,  so  that  all  hiyher  order 

* ^ a.  ^ * Hit* 

1 ^ t £ n 

(>2)  terns  vanish  asynptotically.  As  a result  we  need  only  consider  partitions  into  1 
and  2 eleoient  sets  and  if  {a,b)  « A , ^ * Thus  asynptotically 


- n H 


,<  V.) 


n’ij...i^  ••  "v(l,j)'n‘j' 


where 


‘'v(i,j)‘’'> 


|.1,P(*»  v(i,J)-2ptf) 


i "" 

with  ranging  over  all  partitions  of  (l,2,...,v(i, j)}  into  2-elenent  sets,  of  «<hich 
there  are  p(fi)  , and  l-element  sets,  of  which  there  are  v(i,j)  - 2p(£)  . But  for  each  a , 
the  number  of  partitions  6 with  p(£)  >0  is  precisely  the  coefficient  of  the  appropriate 
Hemite  polynomial. 

IV  Low-order  Cases. 

The  result  sinplifies  if  r,  , “ 0 for  k > 3.  If  only  first  order  terns  are 

present,  then  we  have  the  asymptotic  normality  results  of  Hocffding  (4]. 

If  there  are  only  second  order  terms  present,  then  we  have 

2 ■ I f,)  CjCj  ■ 

(if  we  diagonalize  the  quadratic  form  ^ ^i^j*  off-diagonal  terns  vanish).  This 

corresponds  to  results  of  von  Mises  [7]  and  Filippova  [2]  except  for  the  centering  in  the 
diagonal  terms.  This  allows  us  to  avoid  the  unnecessary  assumption  r^^l  < **  • 

If  both  linear  and  quadratic  terms  arc  present,  then  after  diagonalization  (8)  becomes 
I Pj  (cj-l)  . 

which  is  a convenient  form  for  computation.  This  case  occurs  in  the  case  of  Pitman  alterna- 
tives where  the  null  hypothesis  yields  only  quadratic  terms. 

He  note  that  in  each  of  these  cases  an  expansion  of  Z can  be  exhibited  which  has 
Independent  terms.  Unfortunately  these  considerations  cannot  be  extended  to  higher  order. 
For  exanple  the  third  order  form 

Gj  - 3Gj  + Gj(G2  - 1)  - HjCGj)  + Hj^(Gj^) 

• • • • 

cannot  be  written  as  aH^(G^)  4 QH^CG^)  with  independent  Gaussian  G^  and  G^  . 


-5- 


V,  Additional  Remarks. 


The  results  of  von  Mises  (7]  and  Filippova  (2)  are  couched  in  tenu  of  full  sum. 
Their  assumptions,  however,  are  sufficient  to  allow  hollow  sub  doco«f>ositions  and  the  use 
of  our  method. 


in 


We  note  that  the  square  inteqrability  condition  is  not  necessary  for  our  re<-ilt  to  hold 
some  cases.  For  example,  let  us  consider  Z ■ n”  J ***  where  X has  Man  o 


and  variance  1.  Obviously  we  have  z . - “a*®!*  * ^ Hj(Cj).  However,  the  decoinxjsitlon 


of  Z is 
n 


(9) 


.-3/2 


^n  *n  * 3(n-l)n  [ X 

n^  n^  n c 

"i’^j 


+ 3n‘^''^  II  (X^  -1)X  ♦ lixl  - exh  ♦ n"^^^Exf 

■a  ^ ^ * 


ni/n^ 


1 2 

Now  the  first  two  terms  are  what  make  the  theorem  work.  However,  the  assunption  of  square 
integrability  of  each  term  fails  even  at  the  third,  which  does  not  have  a finite  variance 
unless  the  fourth  moment  of  Xj^  exists.  Obviously  our  condition  is  too  strong,  but  we 
do  not  have  an  appropriate  weakened  form.  Note,  incidentally,  that  <9)  shows  how  coefficients 
in  the  decomposition  can  exhibit  a dependence  on  n even  before  tens-wise  normalization  has 
been  done  (4) . 

Under  the  conditions  imposed,  we  can  say  nothing  about  the  rate  of  the  indicated  con- 
vergence. The  stronger  assumption  of  third  moments  should  be  sufficient  to  allow  the  use  of 
Mthods  of  Sazonov  19,10]  thereby  generalizing  his  result  on  the  rate  of  convergence  for  the 
Cramer-von  Mises  statistics.  Moreover,  moderate  deviation  results  can  probably  be  obtained  under 
appropriate  assumptions  and  the  use  of  techniques  developed  in  Dirkse  (1),  Funk  13),  and 
Rubin  and  Sethuraman  [8] . 

Appendix.  The  evaluation  of  a hollow  sum. 

Then  we  show  that 


o - I ...I  a a 


"k’ 


(Al) 


O - J n C(E)  T (E)  , 

EC* 

tfhere  A ranges  over  all  partitions  of  { l,...,k}  into  non-empty  disjoint  sets,  C{E) 


.E-1 


(-1)  (E-l)l  (E  - number  of  elements  of  E) , and 


(E)  - [ n 


t i*l 


‘it 


-6* 


This  is  saslly  established  by  induction.  For  k ■ I this  states  that  J a 


and  for  k > 2 that 


Mow 


»<  I 2 


lt®2t 


» ■ Z •••  I “in  •••  “k+i, 


•1  ^*V+1 

“i’^J 


■ <z  •••  z ® 

"l  % 
"i****! 

- ZZ  ...  z 


In, 


®k4l,t^ 


P n 


In 

"i'^j 


II  . * • Q O 

"in^  kn^  Vl.n^ 


- To 

It  ^ It 


(A2) 


- t({k+l))  y n C(E)i(E)  - I I,  n C(E)T(E*  ) , 
*•£«*•  p ® E»  fif  ** 


where  E'  - E <J{)c+l)  if  p « E and  E*  - E otherwise,  and  P*  ranqes  over  the  partitions  of 
P P 

{l,...,k)  . 

Let  us  identify  contributions  of  (A2)  to  (Al)  written  over  partitions  £ of 

We  separate  into  two  cases.  If  (k+l)  f£  then  the  relevant  tom  of  (Al)  is 

T({k+l))  n*  C(E)t(E)  which  agrees  with  the  first  tersi  of  (A2).  In  the  other  case,  let  F 
E(  £ 

E/Ck+l) 

be  the  element  of  £ containing  k+1  and  p other  elements.  Then  this  term  of  (Al)  occurs 
p times  in  the  second  term  of  (A2)  as  -H  C(E~(k+l))T(Ep)  and  thus  C(F)  ■ -PC{F'“  (k+D) , 
which  establishes  the  conjecture. 
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equivalent  to  that  of  a linear  combination  of  products  of  Hermite  polynomials 
of  independent  N(0,1)  random  variables.*. 

Connections  with  the  work  of  von  MFtees,  Hoeffding,  and  Filippova  are  noted. 
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